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Description 

The present invention relates to single-chain peptides having insulin activity useful for treatment of diabetes, to 
nucleotide compounds encoding said peptides, to vectors comprising said nucleotides compounds, and to cells trans- 

5 formed with said vectors. 

Although elimination or modification of the naturally occurring connecting peptide (or C-chain) of proinsulin may be 
expected to increase insulin activity relative to native proinsulin, none of these previously modified C-chain proinsulins 
have demonstrated the level of activity of the confounds of the present invention. The proinsulin structure has previ- 
ously been modified by incorporation of shortened C-diain fragments. However, these so called "mini-C" proinsulins 

10 have not demonstrated insulin activity greater than approximately 2% of the native insulin activity. 

The art suggests that a 2-chain insulin construction is preferred over a single-chain insulin construction. Yu and co- 
workers describe a "clipped" proinsulin molecule to form a two-chain cor^truction. Biological Activity of Proinsulin and 
Related Polypeptides in the Fat Tissues). Biol. Chem. 248:3753-3761 (1973). The authors demonstrate that an increase 
in insulin activity results from making a two-chain construct, suggesting that full insulin potency requires a twoK^hain 

15 molecule. Similarly. Peavy et al., demonstrate that dipped (2-chain) versions of proinsulin analogs possess insulin 
activity of less than 20%, but possess greater insulin activity than natural single-chain proinsulin molecules. J. BioL 
Chem. 260:13989-13994 (1985). Markussen, et M- (International Journal of Peptide and Protein Research 26:70-77 
(1985)) describe a molecule wherein the carboxyl terminus of the des(b30) insulin B-chain is linked to the amino termi- 
nus of the A-chain. In later work, this analog was determined to have a three dimensional crystal structure essentially 

20 identical with that of insulin, yet surprisingly retained essentially no insulin activity Derewenda, U. gt J- Mol. Biol 
(1991) 220:425-433. 

Single-chain proinsulin molecules having shortened C-chains have failed to demonstrate sufficient insulin activity 
to qualify as potential therapeutics. Wetzel, ei M- [Gene lfi.-63-71 (1981)] describe a "mini-C" proinsulin having a C- 
chain of six amino acids. This molecule is described as an insulin precursor which may be cleaved to form a two-chain 

25 insulin molecule. There is no discussion of ttie biological activity of the mini-C proinsulin described in tiiis reference. 
Furtiiermore. there is no suggestion that this molecule would have any insulin activity at all. It is merely being used as 
a precursor for generating insulin. TTie estimated biological insulin activity is approximately one-tenth of one percent 
(0.1%) that of insulin. Kubiak and Cowburn (Biochemical Journal (1986) 234:665-670) showed a construction with an 
insulin B-chain of 22 amino acids, an insulin A-chain of 21 amino acids and the full proinsulin C-chain. This molecule 

30 again demonstrated approximately 0.02% insulin potency, even less tiian that of natural sequence proinsulin. Hoechst, 
A.G., European Patent Publication No. 0347781 A2, demonstrates a mini-C proinsulin containing the A and B chains of 
insulin joined by a single arginine residue. There is no biological activity data presented in tills publication, despite tiie 
statement that this is useful for treating diabetes as a sut)stitute for human proinsulin. This reference is directed to the 
construction of an insulin precursor which may be cleaved to create natural sequence insulin from recombinant sources 

35 similar to the Wetzel ei M- reference. Similarly Kobayashi, el al- (Diabetes Research and Clinical Practice 725-28 
(1989)) demonstrate a proinsulin molecule witii a shortened C-chain of the construction b(1-29)-Ala-Ala-Lys-a(1-21) 
proinsulin. This molecule has approximately 0.2% of the activity of human insulin. This reference again further demon- 
strates that mini-C proinsulin molecules having shortened C-chains, especially approximately 3 amino acids, have very 
low insulin activity and would not be suitable as an insulin replacement. 

40 Therefore, the art demonstrates tiiat proinsulin molecules having shortened C-chains possess very little insulin 
activity. Similarly, proinsulin molecules having full lengtii C-chains, i.e., approximately 35 amino adds have very low 
insulin activity, usually only approximately 2% of insulin activity (note the data presented for hPI in a. Table 1, herein). 
Therefore, one would expect that a proinsulin-like molecule having a shortened C-chain or a proinsulin-like molecule 
v^ich has not been clipped, i.e., a single-chain insulin construct, would not be suitable as a tiierapeutic replacement for 

45 insulin. 

Industrial production of insulin from recombinantly-expressed, single-chain insulin precursors involves specific 
enzymatic cleavages. These enzymes add significant production costs for raw materials, process control, and purifica- 
tion. A single-chain peptide with signifcant insulin activity obviates these enzyme-related production costs and repre- 
sents an advancement in tiie art 

so Similarly, mammalian insulin symhesrs involves cleavages of a single-chain precursor catalyzed by highly specific, 
endogenous enzymes found only in pancreatic beta cells. A major challenge for gene therapy for freating diabetes is to 
provide for expression of the insulin precursor in cells having the specific endogenous enzyme activities required for 
proper maturation of the molecule. A solution to this challenge is a single-chain peptide with significant insulin activity. 
Such a single-chain peptide can be expressed in cells lacking the specific endogenous enzymes, thereby increasing 

55 dramatically the types of host c^ls useful for gene therapy of diabetes. 

The instant invention provides a single-chain polypeptide compound of the formula: 

b-BP-a (1) 
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wherein 

b is the B-chain of insulin or a functional analog thereof, 
a is the A-chain of insulin or a functional analog thereof, 
5 BP is a Bridging Peptide. 

The instant invention provides gene therapy methods and constructions allowing for the in vivo production under 
glucose-regulated control of a single-chain polypeptide compound of the Formula 1. 

Nucleotide sequences encoding tiie compounds of Formula 1 are useful in the treatment of both Type I and Type II 
10 diabetes. 

The present invention furtiier provides recombinant nucleotide vectors useful in the treatment of diabetes incorpo- 
rating a nucleotide sequence encoding a compound of Formula 1 . 

The present invention further provides recombinant host cells incorporating said vectors capable of seaeting the 
compound of Formula 1 in response to glucose stimulus. 
15 The present invention further provides a method of treating diattetes mellitus comprising the use of the recombinant 
nucleotide sequences encoding the compounds of Formula 1 . vectors containing said nucleotide sequences, and host 
cells transformed with said vectors. 

As previously indicated herein, the art provides a number of single-chain insulin compounds which fail to demon- 
strate significant or therapeutically useful levels of insulin activity. The following table provides comparative data of 
20 examples of the compounds of formula 1 in comparison to human proinsulln (hPI), human insulin, eind IGF-1 . Two dif- 
ferent assay procedures were employed as Table 1 indicates in vitro data determined in accordance with the procedure 
of Example 5 herein and Table 2 provides in yjye rat bioassay data which was performed in substantial accordance witii 
the teaching of Belagaje, MsL (United States Patent No. 5,304,473). 

25 

Table 1 



Construct^ 


%IQF-1 
Activity^ 


%lnsulin 
Activity^ 


a) hPI 


<0.1 


2.4 


b)des(31-53) hPI 


0.04 


14 


c)bCa 


23 


34 


d) human insulin 


0.19 


100 


e) IGF-1 


100 


.18 



^ all constructs use the naturally occurring 

human peptide sequences 

^ measured tiy procedure of Example 5 

herein. 



Table 2 



Relative In Vivo Rat Bioassay 


Data on a Molar Basis 


human prolnsulin 


ie% 


des(31-53)hPI 


62% 


human insulin 


100% 



55 A comparison of the bioactivity of human proinsulin with tfiat of native insulin demonstrates tiiat the C-chain ("c") is 
a suppressor of insulin activity. As previously discussed, the art indicates that a two-chain insulin molecule possesses 
signrf icantiy greater levels of insulin activity than any single-chain construct, indicating that a single-chain construct pos- 
sessing high insulin activity would be unlikely. Previous studies with mini-proinsulins have validated this concept where 
various connecting peptides have been used to link the insulin A and B-chairts with marked reductions in insulin 
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potency. The compounds of the present invention provide a single-chain insulin comprising the insulin A and B-diains 
resulting in a molecule which possess an unexpectedly, i.e. several fold, increases in both in vitro and in vjvQ insulin 
activities. 

5 BridQinq Peptide: 

A Bridging Peptide is defined for purposes of the present Invention as an amino acid sequence of from about 1 0 to 
about 14 amino acids regardless of amino acid sequence. Furthermore, the insulin activity is not determined by the 
amino acid sequence of the Bridging Peptide. If one compares the two examples of the compounds prepared in accord- 
10 ance with the teaching of the examples, one will note the lack of any sequence hcmology between the Bridging Peptide 
of the compounds. For example the Bridging Peptide of the molecules described in Table 1 have the following amino 
acid sequences: 

GLN-PRO-LEU-ALA-LEUOLU-GLY-SER-LEU-GLN-LYS-ARG des(31 -53)hPI 
GLY-TYR-GLY-SER-SER-SER-ARG-ARG-ALA-PR0<3LN-THR bCa 
15 However, as indicated in the results presented in Table 1 . both of these compounds demonstrate significant insulin 
activity. 

Preferably the Bridging Peptide should not contain amino acid sequences which interact with the IGF-1 receptor. 
Three residues (tyrosine and the tandem arginines) in the IGF-1 C-domain have been identified as being important for 
IGF-1 activity, i.e. interaction with the IGF-1 receptor. 
20 Preferably there should be an absence of cleavage sites within the Bridging Peptide to retain the single-chain 
nature of the protein, especially to intracellular E.odi proteases as well as human serum proteases. 

Preferably, the 12 amino acids are derived from an existing sequence of 12 residues from within the C-peptide of 
hP I so as to minimize potential immunogenicity, but is does not have to be any particular 12-mer from the sequence. 
In the most preferred practice of the invention as exemplified herein, an amino acid sequence of 12 residues is 
25 derived from the sequence of amino acids 54-65 of the human proinsulin C-peptide. 

A Chain and B Chain Functional Analogs 

The invention further provides functional analogs of the compounds of Formula 1 . The term "functional analog" 
30 refers to a molecule having similar functional properties but a modified structure relative to the naturally occun-ing form 
of that molecule or compound. Functional analogs include fragments of (or additions to) the parent molecule having 
similar functional properties and reactivities as the parent molecule. Such functional analogs typically exhibit the same 
qualitative biological activity as the naturally-occurring peptide, although functional analogs may also be designed 
which significantly modify the characteristics of a peptide. Functional analogs are ordinarily engineered variations, txit 
35 such functional analogs include naturally oocun'ing allelic or interspecies variations of the predominant naturally occur- 
ring amino acid sequence. 

Functional analogs of the compounds of formula 1 are generally created by modification of the amino acid 
sequence of insulin A or B chains in a specific and limited manner. While the site for introducing an amino acid 
sequence variation is predetermined, the mutation eer §e need not be predetermined. For example, in order to optimize 
40 the performance of a mutation at a given site, random mutagenesis may be conducted at the target codon or region and 
the resulting functional analogs screened for the optimal balance of desired activities. 

Functional analogs of the peptides are typically generated by deletion, insertion, or substitutions of a single (or few) 
amino add residues. Such modifications generally are made in accordance with the following Table 3: 

45 



50 



55 
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TABLES 



Original 
Residue 


Exemplary 
Substitutions 


Ala 


Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Asn, Gin 


lie 


Leu. Val 


Leu 


lie. Val 


Lys 


Arg, Gin. Glu 


Met 


Leu. He 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He. Leu 



Substantial changes in function and/or immunological identrty may be achieved by selecting sii>stitutions that are 
less conservative than those in Table 3. i.e., by selecting residues that differ more significantly in their effect on main- 
taining (a) secondary or tertiary structure of the polypeptide backbone, (b) the charge or hydrophobicity of the residue, 
or (c) the steric bulk of the amino acyl side chain. For example, substitutions which would ordinarily be expected to pro- 
duce significant alterations in protein properties will be those in which (a) a hydrophllic residue is substituted by a hydro- 
phobic residue; (b) a cysteine (involved in a disulfide linkage) or proline is substituted for any other residue: (c) a residue 
having a positively charged side chain is sii>stituted for a negatively charged residue; or (d) a residue having a bulky 
side chain is substituted for a residue having a much smaller side chain. 

A variety of amino acid modifications have been introduced into the primary structure of insulin insulin A and B 
chains resulting in insulin A-chain and B-chain functional analogs. These functional analogs have demonstrated a wide 
variety of desirable characteristics useful for treating various forms of diabetes, to facilitate commercial (especially 
recombinant) production, and/or to provide more desirable phamiaceutical formulations. A sample list of such modifi- 
cations to the primary structure of insulin and proinsulin is provided in Table 4 below. The instant invention includes 
compounds of Formula 1 incorporating such known nrxxlif ications to the structure of the insulin A- and 6-chains 
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Table 4 



Insulin A-Chain and B-Chain Amino Acid Substitutions 


Gly 


Glu 321 


Ser 321 


Thr bio 


Aspb25 


Ser 


Leu 321 


Gly 322 


Aspbio 


His b25 


Ala 321 


Met 321 


Ala 322 


Arg bio 


Glub26 


His 821 


Tyr 321 


Asp bg 


Ilebi2 


Glu b27 


Asp 321 


Val 321 


Asn bg 


His bi6 


Aspb28 


Thr 321 


ile 321 


Hisbg 


Gin bi7 


Al3 bao 


Gna2i 


Trpa2i 


Glu bio 


Gin b2o 


des-bso 


Prob29 


Lysb28 


Asn a2rNH2 







20 Furthermore, it will known in the art thst the first four and last five amino adds of the B-chain of insulin are generally 
not essential for insulin activity. Consequently, functional analogs of the B-chain as defined in formula 1 emconpass 
deletions and/or substitutions of the amino and/or carboxyl terminal residues of the B-chain. 

In til e preferred practice of the invention, as exemplified herein, the A- and B-chains are derived from human insu- 
lin. However, otiier embodiments of this invention include compounds derived from rabbit, monkey, horse, rat I, rat II, 
25 porcine, bovine, lamb, dog. guinea pig, chinchilla, or duck insulin molecules. Other embodiments of this invention msy 
be directed to functional analogs of the compounds of Formula 1 derived from tiie aforementioned species. 

In the preferred practice of the invention as exemplified herein, a corr^ound of the Formula 1 comprises the amino 
acid sequence: 

PHE-VAL-ASN-GLN-HIS-LEU-CYS-GLY-SER-HIS-LEU-VAL-GLU-ALA-LEU-T\R-L£U-VAL-CYS-^^^ 
30 PHE-PHE-TYR-THR-PRO-LYS-TTHR<3LN-PRO-LEU-ALA-LEU-GLU<3LY-SER-LEU-GLN-LYS-ARG-G^ 

GLU-GLN-CYS-CYS-THR-SER-ILE-CYS-SER-LEU-TYR-GLN-LEU-GLU-ASN-TYR-CYS-ASN (Sequence ID No. 1) 

In the preferred practice of the invention as exemplified herein, another compound of tiie Formula 1 comprises the 
amino add sequence: 

PHE-VAL-ASN-GLN-HIS-LEU-CYS-GLY-SER-HIS-LEU-VAL-GLU-AU^-LEU-TYR-LEU-VAL-CYS-GLY<3 
35 PHE-PHE-TYR-THR-PRO-LYS-TTHR<3LY-TYR-GLY-SER-SER-SER-ARG-ARG-ALA-PRO<3LN-^^ 

GLU-GLN-CYS-CYS-THR-SER-ILE-CYS-SER-LEU-TYR-GLN-LEU-GLU-ASN-TYR-CYS-ASN (Sequence ID No. 2) 

Synthesis: 

40 The conpounds of the present invention may be produced either by recombinant DNA technology or well known 
chemical procedures, such as solution or solidi^hase peptide synthesis, or semi-syntiiesis in solution beginning witii 
protein fragments coupled tiirough conventional solution methods. The synthesis of the conpounds of tiie present 
invention may proceed by solid-phase peptide syntiiesis or by reconrtoinsnt methods, however recombinant methods 
are preferred if a high yield is desired. 

45 

A. Solict Ph3se: 

A compound of Sequence ID No. 2 was synthesized by solid-phase metiioddogy utilizing an Applied Biosystems 
430A peptide synthesizer (commercially av3il3ble from Applied Biosystems. Foster City. Californid) using syntiiesis 
50 cydes sipplied by Applied Biosystems in subst3nti3l accordsnce with the teaching of examples 1-5 herein. 

B, Recombinant DNA: 

The compounds of Formula 1 may also be produced by recomtHnant methods. The basic steps in the recombinant 
55 production of insulin analogs of tiie formula 1 indude: 

a) constructing a synthetic or semi-synthetic DNA encoding the compound of the formula 1 , 
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b) integrating said DNA Into an expression vector in a manner suitable for the expression of the hybrid insulin ana- 
logs of the formula 1, 

c) transforming an appropriate eukaryotic or prokaryotic host cell with said expression vector. 

5 

d) culturing said transformed or transfected host cell, and 

e) recovering and purifying the recombinantly produced compound of the formula 1 . 

10 The coding sequence for recombinant preparation of the compounds of the formula 1 may be wholly synthetic, 
semi-synthetic or the result of modification of the native Insulin cDNAs. DNA sequences designed for recombinant pro- 
duction insulin A and B chains and proinsulln are well known. United States patent No. 4,431,740 (issued February 14, 
1984) describes recombinant production of human prolnsulin. United States Patent No. 4,366,246 (issued Decent)er 
28, 1982) describes DNA sequences encoding and the recomtiinant production of human insulin A-chain and B-chain. 

15 Synthetic genes, the in vitro or in vivo transcription and translation of which will result in the production of compounds 
of the Formula 1 may be constructed by techniques well known In the art. Owing to the natural degeneracy of the 
genetic code, the skilled artisan will recognize that a sizable yet definite number of DNA sequences may be constructed 
which encode the compounds of the Formula 1. 

As exemplified herein a synthetic DNA gene of the sequence: 

20 ATG CGT ATG TTT GTT AAC CAA CAC CTG TGC GGC TCC CAC CTG GTG GAA GOT CTG TAG CTG GTG TGC 
GGT GAA CGT GGC TTC TTC TAC ACC CCG AAG ACG CAG CCG CTG GCC CTC GAG GGT TCC CTG CAG AAG 
CGT GGC ATT GTG GAA CAA TGC TGT ACC AGC ATC TGC TCC CTG TAC CAG CTG GAG AAC TAC TGC AAC 
TAG (Sequence ID No. 3) 

was used for the recombinant production of the compound of formula 1 having the amino acid sequence: 
25 Mel-Arg-Met-PHE-VAL-ASN-GLN-HIS-LEU-CYS-GLY-SER-HIS-LEU-VAL-GLU-AUV-LEU-TYR-LEU-VAL-CYS-GL^^ 
GLU-ARG-GLY-PHE-PHE-TYR-THR-PR0-LYS-THR-GLN-PR0-LEU-AU\-LEU<3LU-GLY-SER-L£U-Gm-L^^^ 
GLY-ILE-VAL<3LU-GLN-CYS-CYS-THR-SER-ILE-CYS-SER-LEU-TYR-GLN-LEU-GLU-ASN-TYR-CYS-ASN 
(Sequence ID No. 4). 

The synthetic DNA sequence identified was prepared by techniques well known in the art in sut}stanljal accordance 

30 with the teachings of Brown, et al- (1979) Methods in Enzymology, Academic Press, N.Y., Vol. 68. pgs. 109-151. The 
DNA sequence was be generated using converrtional DNA synthesizing apparatus such as an Applied Blosystems 
Model 380A or 380B DNA synthesizer (commercially available from Applied Biosystems, Foster City, California). 

Furthermore, direct expression of DNA compounds including compounds of Formula 1 in procaryotic cells results 
in a compound possessing an amino terminal methionine residue. This Met may be removed by the action of cyanogen 

35 bromide (CNBr) where the particular compourxi does not othennnse possess an internal methionine residue. Alterna- 
tively, a Met-less compound may be obtained by engineering an introductory leader sequence which may be removed 
by limited proteolytic digestion (e.g. trypsin, dlamlnqpeptidases such a Cathepsin C (U.S. Patent No. 5,126,249 issued 
June 30, 1992 and European Publication No. 0557076A1 published August 25, 1993), the diaminopeptldase of Dictv- 
ostelium discoideum (European Publication No. 0595476A2 published May 4, 1994) or enterokinase). 

40 In the practice of the invention as exemplified herein, the compound of formula 1 identified as Sequence ID No. 1 
was produced first as a Met-Arg-Met-des(31-53)hPI precursor (Sequence ID No. 4). TTie Met-Arg-Met amino terminal 
sequence is provided for the convenient removal of the characteristic amino terminal methionine resides of recombinant 
proteins through the use of CNBr in substantial accordance with the teaching of example 6 herein. 

Construction of suitable vectors containing the desired coding and control sequences employ standard ligation 

45 techniques. Isolated plasmlds or DNA fragments are cleaved, tailored, and religated in the form desired to form the 
ptasmids required. To effect the translation of the compounds of formula 1, one inserts a DNA sequence encoding the 
compound into an appropriate recombinant DNA expression vector through the use of appropriate restriction endonu- 
cleases. The coding sequence is designed to possess restriction endonuclease cleavage sites at either end of the tran- 
script to facilitate isolation from and integration into these amplification and expression plasmlds. The coding sequence 

50 may be readily modified by the use of synthetic linkers to facilitate the incorporation of this sequence into the desired 
cloning vectors by techniques well known in the art. The particular endonucleases employed will be dictated by the 
restriction endonuclease cleavage pattern of the parent expression vector to be employed. The restriction sites are cho- 
sen so as to properly orient the coding sequence with control sequences to achieve proper in-frame reading and 
expression of compound of the Formula 1 . Furthermore, the coding sequence is positioned so as to be properly aligned 

55 with the promoter and ritx)some binding site of the expression vector, both of which are functional in the host cell in 
which the compound of the Formula 1 is to be expressed. 

In general, plasmid vectors containing promoters and control sequences. which are derived from species compati- 
ble with the host cell are used with these hosts. The vector ordinarily carries a replication site as well as marker 
sequences which are capable of providing phenotypic selection in transformed cells. For example, CQli is typically 
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transformed using pBR322, a plasmid derived from an E ooh species (Bolivar, et al., Gene 2: 95 [1977]). pBR322 con- 
tains genes for ampidllln and tetracycline resistance and thus provides easy means for identifying transformed cells. 
TTie pBR322 plasmid, or other microbial plasmids should also contain or be modified to contain promoters and other 
control elements commonly used in recombinant ONA construction. 

In the preferred practice of the invention as exemplified herein, when the host cell is E. otH K12 RV308. the pro- 
moter-operator region is the lambda pL promoter operator region. 

In the prefen-ed practice of the invention as exemplified herein when the host cell is an E. coN K12 cell, the expres- 
sion vector was pRB284 which contains a DNA sequence encoding the Met-Arg-Met^es(3 1 -53)hPI. The parent vector 
to this plasmid is the vector pCZRI 26S may be prepared according to the teaching of Belagaje. et al. United States Pat- 
ent No. 5,304,473 issued April 9, 1994, the entire teaching of which is herein incorporated by reference. As exennplified 
herein, the plasmid pC2R126S was digested with Ndel and BamHI to remove the bovine growth hormone gene 
encoded ther^y The plasmid pRB284 was constructed by inserting a double stranded DNA sequence encoding the 
Met-Arg-Met<les(31-53)hPI into the Ndel-BamHI site. 

The techniques of transforming cells with the aforementioned vectors are well known in the art and may be found 
in such general references as Maniatis, et a[- (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York or Current Protocols in Molecular Biology (1989) 
and supplements. In addition to prokaryotes, eukaryotic microbes such as yeast cultures may also be used. Saccharo- 
myces cerevisiae . or common baker's yeast is the most commonly used eukaryotic microorganism, although a nun^er 
of other strains are commonly available. For expression in Saccharomyces . the plasmid YRp7, for exanple, (ATCC- 
40053. Stinchcomb. ^ al.. (1979) Nature 282:39; Kingsman Mai- [1979] Gene Z:141 : Tschemper e! al-. [1980] Gene 
10:157) is commonly used. The metiiodology for transforming the E coli cell lines employed in the most preferred prac- 
tice of the invention may be obtained by reference to the Examples herein. The precise conditions under which the 
transformed E gq& cells are cultured is dependent on the nature of the E ^ host cell line and the expression or clon- 
ing vectors employed. For example, vectors which incorporate thermoinducible promoter-operator regions, such as the 
c1 857 thermoinducible lambdaiohage promoter-operator region, require a temperature shift in the culture conditions so 
as to induce protein synthesis. 

As exemplified herein, the plasmid denoted pRB284 was used to transform E. coN K12 RV308 to facilitate recom- 
binant production a bacterial E. ^ environment. The transformed host cell is then cultured to a sufficierrt cell density 
at which time the temperature is shifted from 37°C to about 42°C to initiate transcription from the c 1 857 thermoinducible 
lambda-phage promoter-operator region. 

The recombinantly produced protein is then isolated from the cell culture by well known purification procedures 
employing techniques such as centrifugation, ion exchange chromatography, size exclusion chromatography, RP- 
HPLC, etc. In the recombinant production of des(31-53)hPI as exemplified herein, the transformed E. coH host cells 
were harvested using centrifugation. The cells were lysed by the addition of lysozyme and the first stage isolation of the 
S-sulfonated form of Met-Arg-Met-des(31 -53)hPI compound was achieved by size exclusion chromatography. The Met- 
Arg-Met amino terminal leader sequence was removed by the action of CNBr to yield the des(31-53)hPL The S-sul- 
fonates of des(31-53)hPI molecules were converted to the disulfide-paired, folded des(31-53)hPI molecules using a 
cont)ination of high pH and added thiol in substantial accordance with the teaching of Frank. B.H., gial.. (1981) in Pep- 
tides . Svnthesis. Structure and Function . Proceedings of the Seventh American Peptide Symposium (Rich. D.H. and 
Gross. E. Eds.) pgs. 729-738, Pierce Chemical Co.. Rockford. IL. Subsequent rounds of solvent exchange and purifi- 
cation were achieved using column chromatography in a denaturing environment. The proteins were characterized for 
purity and identity by amino acid analysis, and Fast Atom Bombardment Mass Spectrometry (FAB-MS) with tiie 
expected results. 

Gene Therapy 

Many heritable diseases result from the at)sence of a functional gene necessary to provide the animal with an ade- 
quate supply of a necessary protein. Gene therapy attempts to replace or supply a functional gene encoding the protein 
using recombinant DNA or RNA vectors. Although there are a number of methods used to incorporate the desired DNA 
into the target cells, there are two primary procedures employed termed ©c vivo and in vivo therapy for the introduction 
of the gene therapy nucleotide sequence into the target animal. 

Ex vivQ therapy consists of four primary steps (1 ) the target cells are removed from the individual animal in need of 
therapy, (2) tiie gene therapy nucleotide sequence is incorporated into the target cell in vitro. (3) identification, isolation, 
and expansion of the transformed cells expressing the protein of interest and (4) replacement of the protein expressing 
cells into the animal. Ex yiyo therapy generally results in the incorporation of the nucleotide s^uence encoding the pro- 
tein of interest into the chromosomal DNA of tiie target cell. The essential st^. from a molecular biology standpoint, of 
idyQ gene therapy is tfie proper introduction of the nucleotide sequence encoding the desired protein into the target 
ceil. The introduction of the desired gene is generally accomplished by the use of viral vectors, although other methods 
are also applicakrfe. Two primary types of viral vectors are frequently used for tiie introduction of the foreign gene into 
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the target cell: (1) retroviral vectors arKi (2) adeno-associated viruses (AAVs). The choice of an appropriate retroviral 
vector is dependent on the existence of an appropriate viral receptor on the target cell. A variety of retroviral vectors 
appropriate for the transduction of a variety of cell lines have been described in the literature. The use of retroviral vec- 
tors is generally acconnplished by the use of "packaging cells" which permit the production of high titers of replication- 

s defective recombinant virus free of wild-type virus. [Miller, Human Gene Therapy, volume 1 , number 5 (1990)]. Adeno- 
associated viruses (AAVs) may also be used to efficiently transfer genes in vitro. Production of high titers of replication- 
defective recombinant virus has been described. [Flotte etal. Gene Therapy 229 (1995); Current Topics in Microbiol. 
Immunol m92 (1992), Kotin PNAS(USA) aZ:2211 (1990)]. 

In vivo gene therapy generally describes the introduction of a nucleotide sequence into an animal in need of ther- 

10 apy without previous isolation of a sample of target tissue or cells, jn \dyo therapy generally results in foreign nucleotide 
sequences being transferred to the nucleus of cells of an animal without subsequent integration into the genome. Ade- 
noviral vectors are frequently used to deliver DNA into the cells because these viaises are capable of infecting non- 
dividing celts and expressing foreign genes. Replication defective adenoviruses lacking portions of the El region of the 
viral genome can be propagated by growth in cells engineered to express the El genes (Shenk. Cell 1g:683 (1979)). 

75 Subsequent studies have indicated that replication defective adenoviruses carrying foreign DNA sequences could infect 
non dividing cells. Once inside the cells, the foreign DNA has been shown to express active protein [Becker et al., in 
"Protein Expression in Animal Cells" Methods of Enzymology", (1995)}. 

The treatment of human disease by the inrplementation of gene therapy approaches requires careful design. In par- 
ticular, certain disease states are associated with defective intracellular post-transtational processing mechanisms. 

20 Thus, a potential gene therapy approach to such diseases requires either (1) replacing or supplying the defective 
enzyme or enzymes responsible for post-translational processing of therapeutic proteins or (2) providing a synthetic 
analog of the native protein which is fully active yet not requiring post-translational processing. The present invention 
provides an application of the latter approach. Secondly, gene expression is generally a well regulated system in nature. 
Consequently, constituitive expression of the gene therapy nucleotide sequence is frequently unacceptable. 

25 Type I diabetes mellitus is associated with an inability of the body to produce functional insulin due to a lack of suf- 
ficient pancreatic beta cells. In Type II diabetes, although the individual is capable of producing the insulin precursor 
proinsulin, the body sometimes lacks the ability to fully process proinsulin to insulin. The process of insulin biosynthesis 
in pancreatic beta cells is well established. See e.g. Steiner D.F (1990) Handbook of Experimental Pharmacology Vol. 
92- Insulin, pp. 67-92, and Steiner, D.F. Ch. 1, The Biosynthesis of Biologically Active Peptides: a Perspectivive in Pep- 

30 tide Biosvrrthesis and Processing. (1991) pp. 1-15, L.D. Fricker, ed., CRC Press, NY. In biosynthesis of insulin in the 
animal, the insulin molecule is not immediately exported following translation but is rather stored in a hexameric zinc 
crystalline form in the pancreatic beta cells and released upon stimulation of the insulin receptor by extracelluar glu- 
cose. Consequently, effective gene therapy of diabetes mellitus should incorporate a mechanism of regulatory control 
in response to serum glucose levels. 

35 The in vivo biosynthesis of insulin involves two proteolytic cleavages of the human proinsulin (hPI) structure. For 
convenience, tiiese enzymes are termed endopeptidase 1 and endopeptidase 2. [Alternative nomenclature has been 
advanced by Hutton (Hutton. J.C. (1994) DiabetologicaaZ(suppl.2):S45-S56) and Halban (Halban, PA. (1994) Diabet- 
ologia ^(suppl.2):S65-S72), but for purposes of this discussion the present designation is sufficient.] Endopeptidase 1 
cleaves the hPI structure at the C/A junction resulting in a split form of proinsulin. The Arg-Arg residues at the C/A junc- 

40 tion are then removed by carboxypeptidase H. The des(31 ^2)hPI split form of proinsulin Is then the preferred substrate 
of endopeptidase 2. Since the compounds of Formula 1 do not possess the Arg-Arg structure characteristic of the C/A 
junction of native human proinsulin (hPI) they are not preferred substrates of endop^idase 1 and although the B/C 
junction structure of native hPI remains intact, the failure of the cell to cleave at the C/A junction disfavors proteolysis at 
the B/C junction of tiie compounds of Formula 1. Consequently, because the compounds of Formula 1 are substrates 

45 neither of the initial processing enzyme (endopeptidase 1) nor tiie secondary processing enzyme (endopeptidase 2), 
their single chain structure remains intact 

Nucleotides encoding the conpounds of Formula 1 generally are useful in the treatment of diabetes mellitus via 
gene therapy protocols. Thus nucleotide sequences encoding the peptides of the present invention are useful as tiier- 
apeutic agents in the treatment of Type I diabetes where tiie nucleotide sequences are incorporated into a vector com- 

50 prising (1) a mammalian regulatory region (enhancer and promoter) responsive to glucose with a mechanism of action 
analogous to the mammalian preproinsulin regulatory region. (2) a hybrid nucleotide fragment comprised of a Kozak 
consensus sequence linked to the sequence encoding tiie compounds of tiie present invention and (3) an intron and 
polyadenylation signal. 

Type II diabetes is generally considered to result from the afflicted individual lacking the ability to process preproin- 
55 sulin to insulin. As stated above, the compounds of the present invention do not require post-translational processing. 
Thus the incorporation of the nucleotide sequences encoding compounds of tiie present invention are superior to incor- 
poration of preproinsulin or proinsulin which would not work in such an instance. Using gene knock-out technology, tiie 
native preproinsulin gene on chromosome 11 of beta cells in the islets of Langeriians of type II diabetics could be 
replaced with the nudeotide sequences encoding tiie compounds of the present invention. The newly replaced gene 
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would have the advantage of use of the native transcriptional control sequences and would thus be regulated in the 
same manner as the endogenous gene. 

In the preferred practice of the invention as exemplified herein, a nucleotide sequence encoding the human pre- 
proinsulin signal peptide is linked to a DNA sequence encoding des(31-53)hPI and inserted into a eukaryotic expres- 
5 sion vector capable of integration into the target cell line, "mis eukaryotic expression vector is transfected into cells 
responsive to changes in the external glucose concentration within conventional physiological limits. These cells are 
then inplanted into an individual suffering from diabetes resulting in extracellular secretion of the des{31 -53)hPI analog 
in response to physiological nrxxlulations of blood glucose levels in a mechanism analogous to native beta cells. 

10 A) Constructinq A Synthetic Or Semi-Svmhetic Nucleotide Encoding Ihe Compound Of Tfte Fonrmila 1 

The coding sK^uence for the compounds of the Formula 1 may be wholly synthetic, semi-synthetc or the result of 
modification of the native insulin cDNAs. Synthetic genes, the in vitro or in vivo transcription and translation of which will 
result in the production of compounds of the Formula 1 may be constructed by techniques well known in the art. Owing 

15 to the natural degeneracy of the genetic code, the skilled artisan will recognize that a sizable yet definite number of 
nucleotide sequences may be constructed which encode the compounds of the Formula 1. The amino add sequence 
of human insulin, human pminsulin and human preproinsulin are well known in the art. 

A synthetic DNA sequence encoding the single-chain insulin analog of Formula 1 may be prepared by techniques 
well known in the art in substantial accordance with the teachings of Brown, gt al. (1979) Methods in Enzymology, Aca- 

20 demic Press, N.Y. Vol. gS. pgs. 109-151 . The DNA sequence may be generated using conventional DNA synthesizing 
apparatus such as an Applied Biosystems Model 380A or 380B DNA synthesizer (commercially available from Applied 
Biosystems, Foster City, California). Commercial services are also available for the construction of such nucleotide 
sequences based on the amino add sequence. 

The synthetic gene may also be constructed by assembly of sequences encoding fractions of plasmids containing 

25 sequences for the insulin A chain, insulin B-chain, human proinsulin, and human preproinsulin. Such plasmids are well 
known in the art. United States Patent No. 4,366,246 (issued Deoentoer 28. 1982) describes DNA sequences encoding 
and the recombinant production of human insulin A-chain and B-chain. United States patent No. 4.431 ,740 (issued Feb- 
ruary 14, 1984) describes cDNA encoding human proinsulin and human preproinsulin. 

In the preferred practice of the invention as exemplified herein a synthetic gene is assembled comprising a DNA 

30 sequence encoding the native human preproinsulin signal peptide followed by a DNA sequence encoding a compound 
of the Formula 1 having the DNA sequence: 

ATG GCC CTG TOG ATG CGC CTC CTG CCC CTG CTG GCG CTG CTG GCC CTG TGG GQA CCT GAC CCA GCC 
GCA GCC TTT GTT AAC CAA CAC CTG TGC GGC TCC CAC CTG GTG GAA GCT CTG TAC CTG GTG TGC GGT 
GAA CGT GGC TTC TTC TAC ACC CCG AAG ACG CAG CCG CTG GCC CTC GAG GGT TCC CTG CAG AAG CGT 
35 GGC ATT GTG GAA CAA TGC TGT ACC AGC ATC TGC TCC CTG TAC CAG CTG GAG AAC TAC TGC AAC TAG, 
which results in the expression of compound of Formula 1 (des)31-53)hPI with the native human preproinsulin signal 
peptide sequence, said compound having the amino add sequence: 

Met Ala Leu Trp Met Arg Leu Leu Pro Leu Leu Ala Leu Leu Ala Leu Trp Gly Pro Asp Pro Ala Ala Ala PHE VAL ASN GLN 
HIS LEU CYS GLY SER HIS LEU VAL GLU ALA LEU TYR LEU VAL CYS GLY GLU ARG GLY PHE PHE TYR THR 
40 PRO LYS THR GLN PRO LEU ALA LEU GLU GLY SER LEU GLN LYS ARG GLY ILE VAL GLU GLN CYS CYS THR 
SER ILE CYS SER LEU TYR GLN LEU GLU ASN TYR CYS ASN 

B) iNTEGRATING THE SYNTHETIC GENE INTO A VECTOR IN A MANNER SUITABLE FOR THE EXPRESSiON OF 
THE COIMPOUND OF FORiWULA 1 

45 

In gene therapy, there are alternative means for introdudng the genetic information necessary into the target cell. 
Retroviral Vectors: The use of retroviral vectors allows efficient introduction of genetic information into large numbers of 
cells. Retroviral vectors, espedally those derived from the Maloney murine leukemia virus (M-MLV), are particularly 
useful in the integration of foreign genetic material Into the host chromosome. In the construction of such vectors, the 

so nucleotide sequence encoding the desired compound of Formula 1 is inserted with its control region into the region of 
the retroviral structural gene region of the retrovirus (e.g. the gag, pol, and env genes). 

Retroviral regulatory elements (enhancer and promoters) are capable of conducting efficient expression of the 
transduced genes in a wide variety of target cell types. However, additional regulatory sequences may be added to 
achieve tissue spedf ic expression by the use of tissue specific enhancer and promoter elements. 

55 Alternatively adenoviral vectors may be employed. Jones and Shenk (1979) Cell 16:683, demonstrated that repli- 
cation deficient adenoviruses containing deletions of the El region of the viral genome could be propagated in cells 
engineered to express the El genes. Typical constructions of adenoviral gene therapy vectors are constructed by inser- 
tion of the foreign DNA into the El A-E1 B and E3 regions of the viral genome. Expression of the foreign DNA sequence 
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is generally under control of the El A promoter, the major late promoter (MLP) and associated sequences, the E3 pro- 
moter. 

Other viral vectors are known for their utility in the introduction of foreign nucleotide sequences for gene therapy 
applications. For example, derivatives of adeno-associated virus (AAV), herpes virus and vaccinia virus vectors have 

5 been demonstrated to have utility in gene therapy. 

As an alternative to viral introduction utilizes receptor-mediated methods of gene transfer. In such instances, a 
complex is formed between the target tissue receptor and the plasmid DNA and polypeptides and the reenter on the 
cell surface. The plasmid DNA is taken up by the cell in endocytotic mediated transfer. The plasmid DNA may then be 
released from the vesicle after incorporation into the cell. 

10 Construction of suitable vectors containing the desired coding and control sequences may be constructed by stand- 
ard ligation techniques. Isolated plasmids or nucleotide fragments are cleaved, tailored, and rellgated in the manner 
necesary to achieve the plasmids required. To effect the expression of the compounds of Formula 1, one inserts a 
nucleotide sequence encoding the compound into an appropriate recombinant nucleotide expression vector through 
the use of appropriate restriction endonucleases. The nucleotide sequence encoding the compound of Formula 1 is 

15 designed to possess restriction endonuclease cleavage sites at either end of the transcript to facilitate isolation from 
and integration into these amplification and expression plasmids. The coding sequence may be readily modified by the 
use of synthetic linkers to facilitate the incorporation of the coding sequence into the desired cloning vectors by tech- 
niques well known in the art. The particular endonucleases employed will be dictated by the restriction endonuclease 
cleavage pattern of the parent expression vector to be employed. Restriction sites are chosen so as to properly orient 

20 the coding sequence such that it is properly associated with the promoter and ribosome binding site of the expression 
vector, both of which are functional in the host cell in which the compound of the Formula 1 is to be expressed. 

In general, plasmid vectors containing promoters and control sequences which are derived from species conpati- 
ble with the host cell are used. The vector ordinarily carries a replication site as well as marker sequences which are 
capable of providing phenotypic selection in transformed cells. 

25 In the preferred practice of the invention as exemplified herein, the synthetic gene identified above as Sequence ID 
No. 3 is assembled from two smaller DNA sequences. The final segment is designed to possess a Hindi II restriction 
site at the 5' end and a Bam HI restriction site at the 3' end. The segment is inserted into the MCS (Multiple Qoning Site) 
of the 5400 bp eukaryotic expression vector pcDNA3 (commercially available from Invitrogen. 3985B Sorrento Valley 
Blvd. , San Diego, CA 921 21). Expression of the des(31 -53)hPI coding sequence is under control of the CMV immediate 

30 early promoter; whereas, the neomycin resistance gene is controlled by the SV40 early promoter. 

Target Cell Selection: 

Because the compounds of the present invention do not require post-translalional processing mechanisms, such 
35 nucleotide sequences encoding tiiese compounds may be incorporated into target cells which do not possess the pre- 
proinsulin and proinsulin processing enzymes, i.e. non-beta cells. Secondly, the compounds of tine present invention 
are smaller than preproinsulin and proinsulin and may be more stable in cell lines which do not naturally produce insulin. 
The techniques of transforming mammalian cells with tiie aforementioned vectors are well known in ttie art and may be 
found in such general references as Maniatis. glal. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Har- 
40 bor Press, Cold Spring Hartwr Laboratory. Cold Spring Harbor, New York or Current Protocols in Molecular Biology . Vd. 
1, (1988). Wiley Interscience. and supplements. 

As previously indicated, it is preferred that the secretion of tiie compound of Formula 1 from the host cell should be 
under glucose regulatory control. Cells naturally responsive to glucose are preferred as host cells for gene tfierapy. For 
example, beta islet cells are naturally responsive to external glucose levels. In tiie preferred practice of tiie invention as 
45 exemplified herein, the preferred host cell line for ©c vjyo implantation is ttie mouse pancreatic beta cell line, PTC6-F7 
(Knaack. gt al (1994) Diabetes 4^:1413-1417). 

Constituitive production of insulin is generally not preferred. However, analogs of the conrpound of Formula 1 hav- 
ing lower insulin activity may be constiturtively expressed to maintain a therapeutically effective b>asal level of circulating 
insulin activity. 

50 

FORMULATIONS 

Compounds of Formula 1 , preferably in the form of a pharmaceutically acceptable salt, can be formulated for oral 
or parenteral administration for the tfierapeutic or prophylactic treatment of diabetes mellrtus and/or non-insulin depend- 
55 ent diabetes mellitus (NIDDM). 

For exanple. compounds of ttie Formula 1 can be admixed with conventional pharmaceutical carriers and excipi- 
ents and used in tiie form of tablets, capsules, elixirs, suspensions, syrups, wafers and the like. The compositions com- 
prising IGF-1 analog conpounds will contain from about 0.1 to 90% by weight of the active compound, and more 
generally from about 10 to 30%. The compositions may contain common carriers and excipients such as corn starch or 



11 



EP0741 188 A2 



gelatin, lactose, sucrose, microcrystalline cellulose, kaolin, mannitol, dicalcium phosphate, sodium chloride, and alginic 
acid. Finkenaur (United States Patent No. 4,179,497) describes an aqueous medicinal composition comprising IGF-l 
and a water-soluble polysaccharide to stabilize the peptide against loss of biological activity. This water soluble polysac- 
charide may be, e.g. hydroxypropyi cellulose, methylcellulose. hydroxyethyl cellulose or hydroxypropyl methylcellulose. 

5 Disintegrators commonly used In the formulations of this invention include croscarmellose, microcrystalline cellu- 
lose, corn starch, sodium starch, glycolate and alginic add. Tablet binders that can be Included are acacia, methyl cel- 
lulose, sodium cartx>xymethylcellulose, polyvinylpyrrolidone (Povidone), hydroxypropyl methylcellulose, suaose, starch 
and ethylcellulose. Lubricants tfiat can be used include magnesium stearate or other metallic stearates, stearic add, 
silicone fluid, talc, waxes, oils and colloidal silica. Ravoring agents such as p^ermint, oil of wintergreen, cherry fla- 

10 vortng, or the like can also be used. It may be desirable to add a coloring agent to make the dosage form more attractive 
in appearance or to help identify the product. For intravenous (IV) use, a water soluble form of compounds of the For- 
mula 1 can be dissolved In one of the commonly used intravenous flukfs and administered by infusion. Such fluids, for 
example, physiological saline. Ringer's solution or 5% dextrose solution can be used. For intramuscular preparations, 
a sterile formulation of a suitable soluble salt form of the conpounds of the Formula 1 , for example the hydrochloride 

15 salt, can be dissolved and administered in a pharmaceutical diluent such as pyrogen-free water (distilled), physiological 
saline or 5% glucose solution. A suitable insoluble form of the compound may be prepared and administered as a sus- 
pension in an aqueous base or a pharmaceutically acceptable oil base, e.g. an ester of a long chain Mtty acid such as 
ethyl oleate. 

In some therapeutic applications, it may be desirable to achieve a prolonged release of the compound. Methods of 
20 incorporating the compounds of Fornrula 1 into such delayed release formulations are well known in the art and may be 
found in general texts on the subject. A preferred time release formulation of the compounds of the present invention 
includes the use of poly(alkylcyanoacrylate) particles, commonly known as nanoparticles, as can-iers for tiie compound. 
The use of nanoparticles is described in Grangier, J.L, et al "Nanoparticles as carriers for growth hormone releasing 
factor" (1991) J. of Controlled Release 15:3-13. Another preferred formulation of the compounds of the present inven- 
ts tion is the use of transdermal patches to provide continuous or discontinuous infusion of the compounds of tiie present 
invention in controlled amounts. The construction and use of transdermal patches for the delivery of pharmaceutical 
agents is well known in the art. Such patches may be constructed for continuous, pulsatile, or on demand delivery of 
pharmaceutical agents. 

Alternatively, tiie unit dosage form of the compound can be a solution of the conpound. preferably in its salt form. 
30 in a suitable diluent in sterile hermetically sealed anpoules. The concentration of the compound in the unit dosage may 
vary, e.g. from about 1% to about 50% depending on the particular form of the compound and its solubility and tiie dose 
desired by the physician. 

In practicing this method, compounds of the Formula 1 can be administered In a single daily dose or in multiple 
doses per day. The treatment regime may require administration over extended periods of time. The amount per admin- 
35 istered dose or the total amount administered will depend on such factors as the nature and severity of the disease, tiie 
age and general health of the patient and the tolerance of tiie patient to the compound. The method comprises admin- 
istering to the organism an effective anmunt the compound in a dose between about 1 and 1000 ^g/kg. A preferred 
dose is from about 1 0 to 1 00 ^g/kg of active compound. A typical daily dose for an adult human is from about 0.5 to 50 
mg. 

40 The pharmaceutical formulations comprising the compounds of Formula 1 described above may be administered 
to an organism for therapeutic effects. In a metiiod for treating diabetes mellitus and non-insulin dependent diabetes 
mellitus it is prefen-ed to administer to the organism an effective amount of hybrid insulin/IGF-1 analog in a dose 
between about 1 and 1000 jig/kg. A preferred dose is from about 10 to 100 ng/l^ of active compound. A typical daily 
dose for an adult human is from about 0.5 to 50 mg. 

46 

Example 1, Solid Phase PtePtide Synthesis of bCa 

The synthesis of this material (0.5 mMole scale) began with 0.38 g of Boc-Asn-PAM resin. The peptide was syn- 
thesized by solid-phase metiiodology utilizing an Applied Biosystems 430A peptide synthesizer (commercially available 
50 from Applied Biosystems, Foster City. California) and synthesis cycles supplied by Applied Biosystems. Boc amino 
acids and other reagents were supplied by Applied Biosystems and other commercial sources. Boc-His-BOM was 
obtained from Bachem California, Torrance, California. Sequential Boc chemistry using double coupling and acetic 
anhydride capping were applied to the desired Boc-amino acid-4-(oxymethyl)-phenylacetamidomethyl [PAM] resin. 
Asparagine, glutamine, histidine and arginine were coupled using preformed hydroxy benzotriazole esters. All others 
55 were made using preformed symmetrical anhydrides with docyclocarbodilmide (DCC). 
The following side chain protection is used: 

Arg, tosyl 

Cys, 4-methyIbenzyl 
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Gu, cyclohexyl 

His. benzyloxymethy! (BOM) 

Lys, 2-chlorobenzylc3xycarbonyl 

Ser, benzyl 

Thr, benzyl 

Tyr. 2-bromobenzyloxycaitx)nyl 

Boc deprotection was accomplished with TFA in methylene chloride. Following completion of the synthesis the pep- 
tide was deprotected and cleaved from the resin with 20 ml of anhydrous HP containing 10% meta-cresol. Cleavage of 
the side chain protecting groups and of the peptide from the resin were carried out at zero degrees centigrade for one 
hour After removal of the HP by vacuum distillation, the peptide/resin was washed with ethyl ether and vacuum dried. 

Example 2. Surtitoksis of bCa 

The peptide was solubilized in 100 ml of 8M guanidine HCI containing 0.2M tris, 0.1 M sodium tetrathionate. and 
0.1M sodium sulfite followed by filtration. The sulfitolysis reaction (pH 8.7) was maintained at room temperature for 
three hours, then dialyzed against 0.05M ammonium bicarbonate at 4*^0. The pH was lowered to 3.4 at 4**C for one half 
hour and the resulting isoelectric precipitate was collected by centrrfugation. 

The precipitate was dissolved in 0.1 M ammonium phosphate monobasic, pH 7. arxi purified on a 2.54 x 30 cm 
DuPont Zort)ax® C8 column using a linear gradient of a acetonitrile in 0.1 M ammonium phosphate, pH 7. The appro- 
priate sample pool was rechromatographed on a 2.21 x 25 cm Zorbax® C8 column using a linear gradient of acetonitrile 
in 0.1M ammonium bicarbonate. Appropriate fractions were combined and lyophilized on the basis of analytical HPLC. 

Example 3, Folding/Pisutfide Bond Formation 

12.5 mg of the lyophilized bCa S-sulfonated derivative from Example 2 were dissolved in 25 ml of 20mM glycine. 
pH 9, containing 25 ^il of 0.5M cysteine. After seven hours at 4°C. the reaction was quenched by acidification. 

Example 4. Purification of Folded bCa 

"me folded peptide from Example 3 was purified by multiple analytical runs (0.46 X 25 centimeter Vydac® C18 col- 
umn eluted with a linear gradient of acetonitrile in 0.1% aqueous TFA). The major peak from each run was combined 
and lyophilized to yield approximately 0.5 mg of purified protein. 

The structure was verified by FAB-MS and amino add analysis. Results of FAB/MS gave a molecular weight of 
7038 (theory 7036). Amino acid composition was as follows based on aspartic ackJ as unity (theoretical values in paren- 
theses): Asp 3.0(3); Thr 3.5(4); Ser 4.8(6); Glu 7.8(8); Pro 1.7 (2); Gly 7.7(6); Ala 2.0(2); half-Cys 4.8(6); Val 3.5(4); lie 
1.7(2): Leu 5.8(6); Tyr 4.4(5); Phe 2.7(3); His 2.0(2); Lys 1.0(1); Arg 2.8(3). 

Example 5. IGF-1 and Insulin Assav Procedure 

Measurements of insulin and IGF-1 activities were accomplished in substantial accordance with the teaching of 
Gruppuso. et al. (1988) J. of Clin. Endocrin. and Metab. 67:194-197. Briefly, 30-50 ^ig of human placental membrane 
protein was incubated with approximately 10 femtomoles of [^^^1] ligand in a final volume of 500 of 100 mM HEPES, 
pH 7.8, 120mM NaCI, 5mM KCI. 1 .2mM MgS04, 8mM glucose and 0.25% BSA for 24 hours at 4°C. Membranes were 
collected on glass fiber filters pre-treated with 0. 1% polyethyleneimine by using a cell harvester (Skatron, Uer, Norway) . 
Binding data were analyzed by fitting displacement curves to a four parameter model employing Prefrt/Allfit® software 
for the determination of EC50 values. 

Example 6. Recombinant Preparation of des(3l-53)hPI 

The synthetic DNA sequence identified herein as Sequence ID No.3 was prepared by techniques well known in the 
art in substantial accordance with the teachings of Brown, ei si- (1 979) Methods in Enzymology, Academic Press, NX , 
Vol. 68, pgs. 109-151 . The DNA sequence was be generated using conventional DNA synthesizing apparatus such as 
an Applied Biosystems Model 380A or 380B DNA synthesizer (commercially available from Applied Biosystems. Foster 
City, California). 

About 15mg of plasmid pCZR126S (the construction of which is may be found in Belagaje, et al. United States Pat- 
ent No. 5.304,473 issued April 4, 1994) was suspended in 20^1 of 10X Ndfil buffer, 5m1 of IMel restriction enzyme (40 
units) and 175^1 of water, gently mixed and incubated at 37'*C for 2 hours. After the incubation, the DNA was precipi- 
tated with three volumes of ethanol as above, dried and then resuspended in 20^1 of 10X BamHI buffer, 2.5^1 of 
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BamHI restriction enzyme (25 units) and 178^1 of water After gentle nrtixing, the reaction was incubated at 37**C for 2 
hours. The DNA was again precipitated with three volumes of ethanol and electrophoresed on a 1% low melting agar- 
ose gel. The larger fragment corresponding to the vector DNA was sliced from this gel and the DNA was recovered. 
After precipitation and drying the vector DNA was stored at 4**C in SSjil of 10mM tris-HCI pH 8.0. 

5 About 2.5 n\ of the vector DNA was mixed with 12 ^1 of the purified Met-Arg-Met-des(31-53)hPI gene fragment from 
above. 4 \i\ of lOmN ATP. 0.5 pi of 1M dithiolhreitol, 5 ^1 of lOX ligase buffer (SOOmM tris-HQ. pH 7.6, lOOmM MgCIa), 
26 \i\ of water and 0.5 of T4 DNA ligase (Pharmacia. Inc.. 800 Centennial Avenue, Piscataway. N.J. 08854, 3.5 units). 
Tlie reaction was incubated at 4*»C for 16 hours. The ligated mixture was diluted with 50ml of lOmM tris-HCI (pH 7.6) 
and 3 ^l of IM CaCl2 and then subsequently transformed into E. oo!i K12 R\/308. The cells were plated on T4 agar 

10 plates supplemented with 5 ^g/ml tetracycline and incubated overnight at 32*'C. 

Plasmids from 3 mL cultures were isolated from the tetracycline resistant colonies by the rapid alkaline extraction 
procedure described in Molecular Cloning: A Laboratory Manual . (1982) edited by Maniatis. T, Fritsch, E.F., and Sam- 
brook, J., CokJ Spring Hart)or Publications, New York, pgs.368-369- The presence of the correct Met-Arg-Met-des(31- 
53)hPI gene fragment was determined by the miniscreen procedure according to the teaching of Birnboim, H.C, Edoly, 

15 J. (1979) Nucleic Acids. Res. 1. 1513-1523, using polyaaylamide gel elctrophoresis to analyze the Xba l /Bam Hi 
digested fragment. Those plasmids with the correct size inserts were selected by amplification and purification. The 
expression plasmid containing the Met-Arg-Met-des(31-53]hPI gene is called pRB284. 

Production of cells for extraction and purification of recombinant Met-Arg-Met-des(31-53)hPI was accomplished 
using a BioFlo benchtop fermenter (commercially available from New Brunswick Scientific Co., Inc., RO. Box 986, 44 

20 Talmadge Road. Edison, NJ 08817). Five liters of 2X TY broth containing 5^g/ml tetracycline (obtained from Sigma 
Chemical Co.) plus 1 .0 ml of antrfoam SAG 5693 (commercially available from Union Carbide, Specialty Chemical Divi- 
sion. Danbury. CT 06817-0001) was inoculated with 100 ml of bacterial culture of E. eoli K12 RV 308 cells containing 
the pRB284 plasmid were grown overnight at 30*'C. Cells were grown at 32**C until the end of the exponential growth 
phase. Next, glucose and casein amino acids were added to concentrations of 0.2% and 0.1% respectively and the 

25 temperature shifted to 42°C to induce protein synthesis. The cells were harvested from the growth medium two hours 
post-induction by centrifugation at 500 x g for 10 minutes at 4''C. The supernatant was discarded and the pellet was 
washed once with ice cold TE buffer (lOmM Tris-HCI, pH 8,0, 1 mM EDTA). 

Expression and accumulation of Met-Arg-Met-des(31-53)hPI was determined by visualization of total cell protein 
following separation in a 10-20% polyacrylamide pore gradient gel in substantial accordance with the teachings of Lae- 

30 mmli. U.K. (1970) Nature (London), 227:680-685. Pelleted cells were lysed t>y the addition of modified sample buffer 
(0.1 25M Tris-HCI. pH=6.8, 2%SDS. 30% glycerol, 1M 2-mercaptoethanol, 6M urea) and boiled for 5 minutes prior to 
loading. Bands were detected by staining with Coomassie Blue and quantitated by scanning. 

Specific identification of Met-Arg-Met<les(31-53)hPI was determined by Western Blot analysis in substantial 
accordance with the teaching of Johnson, D.A., gt gl., (1984) Gene Anal. Tech. Vol. 1. pgs.3-8, using goat anti-HPI 

35 which recognizes the C-chain. followed by addition of a biotinylated second antibody (donkey anti-goat IgG) and visu- 
alization with the Vectastain^" protein detection kit (commercially available from Vector Laboratories, Inc., 30 Ingold Rd., 
Burlingame, CA 94010) in substantial accordance with the directions supplied by the vendor. 

The frozen homogenate, approximately 2 liters, from a 10 liter £. ^ fermention of Met-Arg-Met-des(31-53)hPI 
were thawed and centrifued at 4°C at 9000 rpm (13.700 x g) for 9-16 hours. The supernatants were discarded and the 

40 four pellets were kept frozen and individually processed through the following steps of solubilization and desalting. A 
fraction of the product was additionally processed through the follwing steps of RP-HPLC and lyophilization. 

The pellet was dissolved in 300 ml of 7M guanidine HCI. A Tlssuemizer (commercially available from Tekar) was 
used to oonpletely disperse the pellet. 

The sanple was recentrifoged at 9000 rpm (13.700 x g) for 30 minutes at 4<'C. The pellet was discarded. 

45 Tris, Na2S03 and Na2S406 were added to the supernatant obain a final concentration of 0.1M, 0.25 M and 0.1 M 
respectively The sample was S-sulfonated by stinging vigorously for approximately 30 minutes at room temperature. 

The S-sulfonated sample was solvent exchanged into 7M deionized urea. 0.02M tris. pH 7.9. using a 5 x 215 cm 
column packed with Pharmacia 01 0 Sephadex®. 

The main stream pool was purified by loading onto a 5 x 55 cm column packed with Pharmacia Fast Flow Q® using 

50 a linear gradient of 0 to 1 .0 M in sodium chloride in 7M deionized urea, 0.02M tris, pH 7.9 at 4'*C. The appropriate frac- 
tions were pooled and desalted over a 3.7 x 100 cm column of Phamiacia G25C Sephadex® using 0.05M ammoium 
bicart)onate at 4°C 

After addition of acetonitrile to the desalted pool to achieve 1 0% vN, the sarr^le was purified over a 5 x 60 cm CI 8 
reverse phase column using a linear gradient of acetonitrile in 0. 1 M ammonium bicartwnate. Appropriate fractions were 
55 pooled and lyophilized. 

646 mg of Met-Arg-MetKJes(31-53)hPi (S^ulfonated) were dissolved in 137 ml of 0.2M CNBr in 70% formic acid. 
The reaction was allowed to proceed at room temperature in the dark for 3 hours. 

1 60 ml of purified water were added to the sample which precipitated the product The sanrple was kept frozen then 
thawed and centrifuged at 2000 rpm (approximately 3000 x g) for one hour at 4'>C. The supernatant was discarded. 
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The pellet was disulfide bond folded by dissolving in 1 20 ml of 20mM glycine 0.5mM cysteine, pH 10.6, at 4<'C. The 
reaction was quenched after 24 hours by lowering the pH to 3.4 using phosphoric add. 

After addition of acetonrtrile to the product to achieve a 10% v/v, the sample was purified over a 2.12 x 25 cm 
DuPont Zorbax® C8 reverse phase column using a linear gradient of acetonitrile in 0.1 M pH 2.3 monobasic sodium 
s phosphate. 

Appropriate fractions were pooled and diluted 2 fold with purified water and additionally purified and desalted using 
a 2.12 X 25 cm Vydac® C18 reverse phase column and a linear gradient of acetonitrile in 0.1% TFA. Selected fractions 
of des(31-53)hPI were pooled and lyophilized. 

The structure was verified by FAB-MS and amino add analysis. Results of FAB/MS gave a molecular weight of 
10 71 1 1 .7 (theory 71 1 1.2). Amino add composition was as follows based on aspartic acid as unity (theoretical values in 
parentheses): Asp 3.0(3); Thr 2.8(3); Ser 3.4(4); G!u 10.0(10); Pro 1.9(2); Gly 5.0(5); Ala 2.0(2); half-Cys 4.6(6); Val 
3.4(4); lie 1.5(2); Leu 8.9(9); Tyr 3.7(4); Phe 2.9(3); His 2.7(2); Lys 2.0(2); Arg 1.9(2). 

EXAMPLE 7. Gene Therapy Vector Construction 

15 

The synthetic DNA sequences utilized herein may be prepared by techniques well known in the art in substantial 
accordance with the teachings of Brown, et al- (1979) Methods in Enzymology, Academic Press, N.Y, Vol. 68, pgs. 109- 
151 . The DNA sequences are generated using conventional DNA synthesizing apparatus such as an Applied Biosys- 
terns Model 380A or 380B DNA synthesizer (commercially available from Applied Biosystems, Foster City, California). 

20 A recombinant DNA sequence encoding the human pr^roinsulin signal p^de followed by DNA encoding des(31 - 
53)hPI analog is constructed in two sections. A first section encoding the signal peptide and the 5' end of the B^^hain 
is synthesized by conventional DNA synthesis techniques. This 1 1 5 base pair DNA fragment is designed to incorporate 
a Hind lll site at the 5' end and a Dralll site at the 3' end. The sequaice of this HInd lll-Dralll DNA fragment is as follows: 
AGCTTCCACC ATG GCC CTG TGG ATG CGC CTC CTG GCC CTG CTG GCG CTG CTG GCC CTG TGG GGA CCT 

25 GAC CCA GCC GCA GCC TTT GTT AAC CAA CAC CTG TGC GGC TCC CAC CTG. 

A second DNA fragment encoding the remainder of the human insulin B-chain, amino acids 54-65 of the human insulin 
C-peptide and the human insulin A-chain is constructed as a 171 base pair DNA sequence designed to incorporate a 
Drain restriction endonuclease cleavage site at its 5' end and a Bam HI restriction endonuclease cleavage site at its 3' 
end. The DNA sequence of this Dralll- Bam HI fragment is as follows: 

30 GTG GAA GCT CTG TAC CTG GTG TGC GGT GAA CGT GGC TTC TTC TAC ACC CCG AAG ACG CAG CCG CTG 
GCC CTC GAG GGT TCC CTG CAG AAG CGT GGC ATT GTG GAA CAA TGC TGT ACC AGC ATC TGC TCC CTG 
TAC CAG CTG GAG AAC TAC TGC AAC TAG. 

The Hind lll-Dralll fragment and the Dralll- Bam HI fragment are ligated resulting in a Hind lll- Bam HI fragment encoding 
the native human preproinsulin signal peptide, the human insulin B-chain, amino acids 54-65 of the human proinsulin 
35 C-peptlde, and the human insulin A-chain. 

Example 8, Vector Construction 

This Hind lll- Bam HI fragment prepared in substantial accordance with the teaching of Example 7 above is inserted 
40 between the UmdlH and BamHI sites in the MCS (Multiple Cloning Site) of the 5400 bp eukaryotic expression vector 
pcDNA3 (commerdally available from Invitrogen. 3985B Sorrento Valley Blvd.. San Diego, CA 92121). The resulting 
vector is termed phPldes(31 -53). Expression of the des(31-53)hPI coding sequence is under control of the CMV imme- 
diate early promoter; whereas, the neomydn resistance gene is controlled by the SV40 early promoter. 

45 Example 9. Transfection 

The recombinant vector phPldes(31-53) prepared in substantial accordance with Example 8 above is then trans- 
fected into the mouse pancreatic beta cell line, pTC6-F7 (Knaack, et al (1994) Diabetes 43:1413-1417, 1994) using 
standard lipid mediated transfection protocols. (See e.g. Current Protocols in Molecular Biology, reference specifics). 
so The cells are cultured in Dulbecco's modified Eagle's medium containing 25 mmol/liter glucose and supplemented 
with 15% horse serum and 2.5% fetal bovine serum at 37*'C in 5% CO2. 

Example 10. Estat)lishment of Monclonal Cell Lines 

55 Forty eight hours after transfection, cells containing phPldes(31-53) are selected using the antibiotic G418 
(Genetidn sulfate). After three to four weeks in medium containing G418, monodonal cell lines are established which 
secrete tiie des(31-53)hPI protein in response to physiological levels of glucose. 

Because pTC6-F7 cells release insulin in response to physiological concentrations of glucose, the des(31-53)hPI 
protein is secreted into the culture media when the cells are grown in the presence of 5-15 mmol/liter glucose. Identifi- 
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cation of those cultures producing the des(31'53)hPI protein are identified and quantitated by immunoassay using an 
antibody which recognizes the junction of the C-A junction of human proinsulin. A polyclonal antibody is commerdaily 
available from Linco Research, Inc.. St. Charles. MO 63304. 

5 Example 11. Implantation 

The cells prepared in accordance with the teaching of Example 10 are implanted by surgical procedure into an ani- 
mal suffering from diabetes mellitus type I or II. In response to physiologica) changes in glucose concentration, the 
des(31-53)hPI analog is released allowing modulation of serum glucose levels. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Ell LILLY AND COMPANY 

(B) STREET; Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(E» COUNTRY: United States of America 
{¥) ZIP; 462d5 

(ii) TITLE OF INVHVriON: SINGLE CHAIN INSULIN WITH HIGH BIOACTIVITY 

(iii) NUMBER OF SBQuENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: C. M. Hudson 

(B) STREET: Erl Wood Manor 

(C) CITY: Kindlesham 

(D) STATE: Surrey 

(E) COUNTRY: United Kingdom 

(F) ZIP: GU2: 6PH 

(v) COMPUTER REAEABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

( B ) COMPUTER : Macintosh 

(C) OPERATIl^ SYSTEM: Macintosh 7.0 

(D) SOFTWARE: Microsoft Word 5.1 

(vi) CURRENT APPLXCATION DATA: 

(A) APPLICATION NO: 96303133.1 

(2) INFORMATION FOR SEQ ID NO: 1; 

(i) SEQUENCE CHARACTSRISTtCS: 

(A) LENGTH: 12 amino acids 
(B} TVPE: amino acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

55 Gin Pro Leu Ala Leu Glu Gly Ser Leu Gin Lys Arg 

15 10 
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(2) INFOHMATIOM FOR SBQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXSY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 2: 

Gly Tyr Gly Ser Ser Ser Arg Arg Ala Pro Gin Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Phe Val Aen Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
15 10 15 

Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Gin Pro 
20 25 30 

Leu Ala Leu Glu Gly Ser Leu Gin Lys Arg Gly lie Val Glu Gin Cys 
35 40 45 

40 Cys Thr Ser lie Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 

50 55 60 



(2) INFORMATION FOR SEQ ID NO: 4: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

so 

(ii) MOLECULE TYPE: peptide 



55 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 4: 

Phe Val Asn Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
15 10 15 



Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Gly Tyr 
20 25 30 



Gly Ser Ser Ser Arg Arg Ala Pro Gin Thr Gly lie Val Glu Gin Cys 
35 40 45 



Cys Thr Ser lie Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cim 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATCCGTATGT TTGTTAACCA ACACCTGTGC GGCTCCCACC TGGTGGAAGC TCTGTACCTG 
60 

GTGTGCGCTG AACGTGGCTT CTTCTACACC CCGAAGACGC AGCCGCTGGC CCTCGAGGGT 
120 

TCCCTGCA6A AGCGTGGCAT TGTOGAACAA TGCTGTACCA GCATCTGCTC CCTGTACCAG 
180 

CTGGAGAACT ACTGCAACTA G 
201 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6; 

Met Arg Met Phe Val Asn Gin His Leu Cys Gly Ser His heu Val Glu 
15 10 15 



Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys 
20 25 30 



Thr Gin Pro Leu Ala Leu Glu Gly Ser Leu Gin Lys Arg Gly He Val 
35 40 45 



Glu Gin Cys Cys Thr Ser He Cys Ser Leu Tyr Gin Leu Glu Asn Tyr 
50 55 60 



Cys Asn 
65 



(2) INFORMATICS* FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGCCCTGT GGATGCGCCT CCTGCCCCTG CTGGCGCTGC TGGCCCTGTG OGGACCTGAC 
60 

CCAGCCGCAG CCTTTGTTAA CCAACACCTG TGCGOCTCCC ACCTGGTGGA AGCTCTGTAC 
120 

CTGGTGTGCG GTGAACGTGG CTTCTTCTAC ACCCCGAAGA CGCAGCCGCT GGCCCTCGAG 
180 

GGTTCCCTGC AGAAGCGTGG CATTGTGGAA CAATGCTGTA CCAGCATCTG CTCCCTGTAC 
240 

CAGCTGGAGA ACTACTGCAA CTAG 
264 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 87 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Het Ala Leu Trp Met Arg Leu Leu Pro Leu Leu Ala Leu Leu Ala Leu 
15 10 15 

Trp Gly Pro Asp Pro Ala Ala Ala Phe Val Asn Gin His Leu Cys Gly 
20 25 30 

Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys Gly Glu Arg Gly Phe 
35 40 45 

^ Phe Tyr Thr Pro Lys Thr Gin Pro Leu Ala Leu Glu Gly Ser Leu Gin 

50 55 60 



20 



30 



40 



45 



50 



Lys Arg Gly He Val Glu Gin Cys Cye Thr Ser He Cys Ser Leu Tyr 
65 70 75 80 



Gin Leu Glu Asn Tyr Cys Asn 
85 



^ (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AGCTTCCACC ATGGCCCTGT GGATGCGCCT CCTGCCCCTG CTGGCGCTGC TGGCCCTGTG 
60 

GGGACCTCAC CCAGCCGCAG CCTTTGTTAA CCAACACCTG TGCGGCTCCC ACCTG 
1X5 



55 
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(2) INFORMATION FOR SEQ 13 NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GTQGAAGCTC TGTACCTGGT GTGCG0TC5AA CGTGGCTTCT TCTACACCCC GAAGACGCAG 
60 

20 CCGCTGGCCC TCGAGGGTTC CCTCCAGAAG CGTGGCATTG TGGAACAATG CTGTACCAGC 

120 

ATCTGCTCCC TGTACCAGCT GGAGAACTAC TGCAACTAG 
159 



30 Claims 



35 



1 , A polypeptide compound of the formula (1): 

b-BP-a (1) 

wherein: 



b is the B-chain of insulin or a functional analog thereof, 
40 a is the A-chain of insulin or a furK^tional analog thereof, and 

BP is a Bridging Peptide of from about 10 to about 14 amino acids. 

2. The compound of Claim 1 wherein b and a are tiie naturally occuring B-chain and A-chain of human insulin, respec- 
45 tively. 

3. The compound of Claim 2, wherein BP is about 1 2 amino adds. 

4. The compound of Claim 3 consisting essentially of the amino add sequence: 

50 Phe-Val-Asn-Gln-His-Leu-Cys-Gay-Ser-His-Leu-Val<3lu-Ala-Leu-Tyr-Leu-Val-Cys-Oly-Glu-Arg-a 
Thr-Pro-Lys-Thr<31y-Tyr<ay-Ser-Ser-Ser-Arg-Arg-Ala-Pro-Gln-Thr<5ly-lle-Val^ 
Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn. 

5. The compound of Claim 3 consisting essentially of the amino acid sequence: 

55 Phe-Val-Asn-Gln-His-Leu-Cys<3ly-Ser-His-Leu-Val<5lu-Ala-Leu-Tyr-Leu-Val-Cys<3iy-Glu^ 
Thr-Pro-Lys-Thr-Gn-Pro-Leu-Ala-Leu-Glu-Gly-Ser-Leu<aIn-Lys-Arg<3ty-lle-Val-Glu^ 
Cys-Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn. 



21 



EP0741 188 A2 



6. A pharmaceutical formulation comprising, as an active ingredient a polypeptide compound of formula I, or a phar- 
maceutically acceptable salt thereof, as claimed in any one of Claims 1 to 5, associated with one or more pharma- 
ceutically acceptable carriers thereof. 

5 7. A polypeptide compound of formula I. or a pharmaceutically acceptable salt thereof, as claimed in any one of 
Clainr^ 1 to 5, for use in treating diabetes. 

8. A nucleotide sequence encoding a polypeptide of formula 1 , as claimed In any one of Claims 1 to 5. 

10 9. The nucleotide sequence of Gaim 8 consisting essentially of the DNA sequence 

TTT GTT AAC CAA CAC CTG TGG GGC TCC CAC CTG GTG GAA GCT CTG TAC CTG GTG TGC GGT GAA 
CGT GGC TTC TTC TAC ACC CCG AAG ACG CAG CCG CTG GCC CTC GAG GGT TCC CTG CAG AAG CGT 
GGC ATT GTG GAA CAA TGC TGT ACC AGC ATC TGC TCC CTG TAC CAG CTG GAG AAC TAC TGC AAC. 

15 10. A recombinant nucleotide vector comprising a nucleotide sequence as claimed in Oaims 8 or 9. 

1 1 . A cell line transformed with a recombinant nucleotide vector of Claim 10. 

12. The cell line as claimed in Claim 1 1 for use in implanting the cell line for treating diabetes. 
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